Spectral clustering for divide-and-conquer graph matching

نویسندگان

  • Vince Lyzinski
  • Daniel L. Sussman
  • Donniell E. Fishkind
  • Henry Pao
  • Li Chen
  • Joshua T. Vogelstein
  • Youngser Park
  • Carey E. Priebe
چکیده

We present a parallelized bijective graph matching algorithm that leverages seeds and is designed to match very large graphs. Our algorithm combines spectral graph embedding with existing state-of-the-art seeded graph matching procedures. We justify our approach by proving that modestly correlated, large stochastic block model random graphs are correctly matched utilizing very few seeds through our divide-and-conquer procedure. We also demonstrate the effectiveness of our approach in matching very large graphs in simulated and real data examples. V. Lyzinski, D.L. Sussman, D.E. Fishkind, H. Pao, L. Chen, J.T. Vogelstein, Y. Park, C.E. Priebe, “Spectral Clustering for Divide-and-Conquer Graph Matching,” Parallel Computing, accepted for publication, 2015. V. Lyzinski Background Given two graphs, G1 = (V1, E1) and G2 = (V2, E2), the Graph Matching Problem (GMP) seeks an alignment between the vertex sets V1 and V2 that best preserves structure across the graphs. In bijective graph matching, we further assume |V1| = |V2| = n, and the alignment sought by GMP is a bijection between V1 and V2. Graph Matching Problem Find a bijection ψ : V1 → V2 minimizing the quantity ∣∣{ (i, j) ∈ V1 ×V1 s.t. [i ∼G1 j, ψ(i) G2 ψ(j)] or [i G1 j, ψ(i) ∼G2 ψ(j)]}∣∣, (1) i.e. the problem seeks to minimize the number of edge disagreements between G2 and “ψ(G1)”. Equivalently stated, if A and B are the respective adjacency matrices of G1 and G2, then this problem seeks to minimize ‖A − PBP‖F, over all permutation matrices P∈Π(n) := {n× n permutation matrices}, with ‖ · ‖F the matrix Frobenius norm. 3 / 10 Background In the seeded graph matching problem (SGMP), we further assume the presence of a latent alignment φ between the vertex sets of G1 and G2. Our task is to then efficiently leverage the information in a partial observation of the latent alignment, i.e. a seeding, to estimate the remaining latent alignment. Seeded Graph Matching Problem Given subsets of the vertices S1 ⊂ V1 and S2 ⊂ V2 called seeds with |S1| = |S2| = s and a bijective seeding function φS : S1 → S2, the task is to use φS to estimate φ by finding the bijection extending φS which minimizes (1).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Clustering in WSN Based on Minimum Spanning Tree Using Divide and Conquer Approach

Due to heavy energy constraints in WSNs clustering is an efficient way to manage the energy in sensors. There are many methods already proposed in the area of clustering and research is still going on to make clustering more energy efficient. In our paper we are proposing a minimum spanning tree based clustering using divide and conquer approach. The MST based clustering was first proposed in 1...

متن کامل

Clustering for Data Reduction: A Divide and Conquer Approach

We consider the problem of reducing a potentially very large dataset to a subset of representative prototypes. Rather than searching over the entire space of prototypes, we first roughly divide the data into balanced clusters using bisecting k-means and spectral cuts, and then find the prototypes for each cluster by affinity propagation. We apply our algorithm to text data, where we perform an ...

متن کامل

A Divide and Conquer Framework for Distributed Graph Clustering

Graph clustering is about identifying clusters of closely connected nodes, and is a fundamental technique of data analysis with many applications including community detection, VLSI network partitioning, collaborative filtering, etc. In order to improve the scalability of existing graph clustering algorithms, we propose a novel divide and conquer framework for graph clustering, and establish th...

متن کامل

A hybrid ensemble approach for the Steiner tree problem in large graphs: A geographical application

Hybrid approaches are often recommended for dealing in an efficient manner with complex problems that require considerable computational time. In this study, we follow a similar approach consisting of combining spectral clustering and ant colony optimization in a two-stage algorithm for the purpose of efficiently solving the Steiner tree problem in large graphs. The idea of the two-stage approa...

متن کامل

Free Vibration Analysis of Repetitive Structures using Decomposition, and Divide-Conquer Methods

This paper consists of three sections. In the first section an efficient method is used for decomposition of the canonical matrices associated with repetitive structures. to this end, cylindrical coordinate system, as well as a special numbering scheme were employed. In the second section, divide and conquer method have been used for eigensolution of these structures, where the matrices are in ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Parallel Computing

دوره 47  شماره 

صفحات  -

تاریخ انتشار 2015